Limit theorems for random spatial drainage networks 



Mathew D. Penrose*^ 
Department of Mathematical Sciences, University of Bath, 
Claverton Down, Bath BA2 7AY, England. 

Andrew R. Wade*§ 
Department of Mathematics, University of Bristol, 
University Walk, Bristol BS8 ITW, England. 

21st January 2009 

Abstract 
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flow with nonnegative component in one of the canonical orthogonal basis directions 
of M'^, d > 2. The resulting network is a version of the so-called minimal directed 
spanning tree. We give laws of large numbers and convergence in distribution results 
on the large-sample asymptotic behaviour of the total power-weighted edge-length of 
the network on uniform random points in (0, 1)*^. The distributional results exhibit 
a weight-dependent phase transition between Gaussian and boundary-effect-derived 
distributions. These boundary contributions are characterized in terms of limits of the 
so-called on-line nearest-neighbour graph, a natural model of spatial network evolution, 
for which we also present some new results. Also, we give a convergence in distribution 
result for the length of the longest edge in the drainage network; when d = 2, the limit 
is expressed in terms of Dickman-type variables. 
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1 Introduction 



We consider a continuum model of drainage through a porous medium in M*^ (o? G N := 
{1, 2, 3, . . .}), which we first describe informally. Let {ei, . . . , e^} be the canonical orthonor- 
mal basis of M.'^. We distinguish the direction and suppose that 'gravity' acts in direction 
— e^; in free space, liquid would fall in exactly the — direction. 

Informally, consider a unit d-cube, representing a block of porous material. We scatter 
a certain finite set X of points in this cube, representing special sites in the medium. We 
constrain liquid to drain in channels that visit every site and travel in straight lines from site 
to site. The vectors of each channel must have a non-positive component in the direction; 
that is, they must respect gravity. The collection of channels spanning X satisfying these 
conditions we call a drainage network on A". A natural question is to find the most efficient 
arrangement of channels satisfying the above constraints, i.e., a drainage network that is in 
some sense optimal. As we shall see, an answer to this question is a version of the so-called 
minimal directed spanning tree (MDST for short) on the vertices X. 

More mathematically, let A" be a finite point set in (0, 1)'^ whose points have distinct d-th 
coordinates. We construct a directed graph on vertex set X as follows. Join each vertex 
X G A" by a directed edge to a Euclidean nearest neighbour (if one exists, and arbitrarily 
breaking any ties) amongst those points y & X \ {x} such that y x. Here is the 
order on X induced by the order on d-th coordinates: {xi, . . . ,Xd) ^* {yi, . . . ,yd) if and 
only if Xd < Vd- We call the directed graph so constructed the MDST on {X;=4*)- it is a 
mathematical solution to the problem of constructing a minimal-length drainage network on 
X as informally described above. 

The subject of this paper is the MDST on {Vn, ^*) where Vn is a homogeneous Poisson 
point process of intensity n > on (0, l)'^. Then (with probability 1), Vn is indeed a finite 
point set with distinct d-th coordinates so that the MDST is almost surely well-defined. We 
study the total power- weighted edge-length of the MDST on {Vn, ^*) as n — > oo, and also 
the length of the longest edge. 

The MDST on {Vn, ^*) is an example of a random spatial graph, that is, a graph gener- 
ated by scattering points randomly into a region of space and connecting them according to 
some prescribed rule. Motivated in part by real- world networks with spatial content, such as 
communications networks (including the Internet), social networks, and physical networks, 
a substantial body of recent research has dealt with the large-sample asymptotic theory of 
such graphs. Examples include the geometric graph, the nearest-neighbour graph, and the 
minimal-length spanning tree (MST). See, for example, [3,14,17,18,20,21,26,27,31,34]. 
A feature that distinguishes the MDST considered here from other random spatial graphs 
is that the constraint on direction of the edges can lead to significant (indeed, sometimes 
dominating) boundary effects due to the possibility of long edges occurring near the lower 
boundary cube (0, 1)"^"^ orthogonal to e^^. Another difference is the fact that there is no 
uniform upper bound on vertex degrees in the MDST. 

In general, the MDST can be defined on any finite partially ordered set in M"', as described 
in [22]; a survey of results on the random MDST is given in [25]. Examples considered 
previously are the 'cooridnate-wise' (or 'South- West') partial ordering on point sets in (0, 1)^ 
[7,22,23] or in (0, 1)"^ [5], and the radial spanning tree [4] on point sets in M^. Also, laws of 
large numbers for the MDST on a class of partial orders of were given in [32] . 
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In this paper we are concerned with the 'South' partial order which is even a total 
order, on point sets in M*^ with distinct d-coordinates. Our main results, Theorems 2.1 and 



2.2, give laws of large numbers, convergence of expectation, and distributional convergence 



results for the total power-weighted edge-length of the MDST on {Vn, for d > 2. We also 
give a convergence result for the maximum edge-length in the MDST (Theorem 2.3). Our 



main distributional limit result. Theorem |2.2[ reveals two regimes of limit behaviour for the 
total power-weighted edge-length depending on the power-weighting, in which the limit law 
is either purely normal or given in terms of boundary effects characterized as distributional 
limits of certain on-line nearest-neighbour graphs. At a critical point between these two 
regimes, there is a phase transition at which both effects contribute significantly to the limit 
law. In order to understand the boundary effects in the MDST, and its longest edge, we 
make use of the fact that near to the boundary, the MDST is well-approximated by a certain 
on-line nearest-neighbour graph. 

In the on-line nearest-neighbour graph (ONG), each point after the first in a sequence 
of points arriving sequentially in M*^ is joined to its nearest neighbour amongst those points 
already present. The ONG itself is of separate interest as a simple growth model for random 
networks, such as the world wide web graph (see [6]). The total power- weighted length of 
the ONG has been studied in [18,24,32,33]. In the present paper, the ONG arises as a 
natural tool for studying the structure of the MDST near to the boundary; we also prove a 
new result (Theorem 3.1 ) on the length of the longest edge in the ONG on uniform random 
points in (0, l)''. 

In the particular case of the total weight of the MDST on [Vn] ^*) when d = 2, which 
is one of the most natural cases, the boundary contributions to the total power-weighted 
edge-length limit laws can be characterized in terms the limiting distribution of the total 
weight of the one-dimensional ONG (centred as necessary). Results from [24] say that such 
a distribution is characterized by a distributional fixed-point equation. Such fixed-point 
equations, and the 'divide and conquer' algorithms from which they often arise, are also a 
subject of considerable recent interest; see, for example, [2,16,29]. 

Mathematically, much of the motivating interest comes from the desire to further under- 
stand the interplay between stochastic geometry and distributional fixed points previously 
more commonly seen in the analysis of algorithms (see e.g. [16]). This relationship was 
first seen in our previous work [23,32] on limit theorems for the length of the 'South- West' 
MDST in the unit square. The present work adds to this by considering the 'South' MDST, 
for which the fixed-point distributions which arise are different. We remain some way from 
having a full description of the limits for all possible partial orders, other shapes of domain 
and non-uniform densities. 

In [23,32], only the case d = 2 of the 'South- West' MDST was studied. In the present 
paper we deal also with higher dimensions. With fairly straightforward modifications, the 
method used in [23] could be adapted to prove the d = 2 case of our Theorem 2.2 below. 
However, at several points the proofs used in [23] are not easily adapted to higher dimensions, 
and thus we have adopted different proofs; sometimes these improve or extend ideas from [23] 
and sometimes we use entirely different techniques. Another difference is that [23, 32] made 
use of general results of Penrose and Yukich [26, 27] while in the present paper we instead 
use the results of Penrose [20,21] (see also [19]) which are in several ways more convenient 
for the current application. Thus the results of the present paper are of a similar (albeit 
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general-dimensional) flavour to those in [23,32], but the proofs are different. 

Before describing our results in detail, we return to the question of motivation. General 
motivation for the MDST is as a model for a constrained optimal transport network (see 
e.g. [25]). As has been mentioned elsewhere (e.g. [7]), the MDST can be motivated by com- 
mimications networks. However, in the present case the primary motivation is from drainage 
networks. From this point of view, our choice of 'South' partial ordering seems the most 
natural, and the two most natural choices of d are d — 2 and d — 3. For further references 
on the mathematical modelling of drainage networks, and a related infinite lattice version 
of this model, for which rather different properties were studied, see [12]; for background on 
modelling of drainage networks in general, see also [28]. 

2 Statement of results 

In this section we give formal definitions of our model and state our main results. Let 
d E N. Let A" be a finite subset of R*^ endowed with the binary relation =^*, for which 
{xi, . . . ,Xd) =4* {yi-i ■ ■ ■ -lUd) if and only if xa < ya- Assume that all the elements of X 
have distinct Xfi-coordinates. Under this assumption, is a partial order on X (in fact, 
a total order), and so the MDST that wc shall construct fits into the theory of the MDST 
on partially ordered sets given in [22,25]. Let card(A') denote the cardinality (number of 
elements) of the set X. 

A minimal element, or sink, is a vertex x e A" for which there exists no y e A" \ {x} such 
that y X. Thus under our definition of and our assumption on X , there is a unique 
sink having strictly minimal x^-coordinate and which we shall denote m{X). 

For a vertex x G A' \ {m(A')}, we say that y & X \ {x} is a directed nearest neighbour 
(in the ^*-sense) of x with respect to A" if y ^* x and 



here and subsequently || ■ ||d denotes the Euclidean norm on W^. For each x G A" \ {m(A:')} 
let Hx := n(x;A') denote a directed nearest neighbour of x with respect to X, chosen 
arbitrarily if x has more than one directed nearest neighbour. A minimal directed spanning 
tree (MDST) on [X;^^), or simply 'on A" from now on, is a directed graph with vertex 
set X and edge set {(x, Hx) : x G A' \ {m(A')}}. That is, there is an edge from each point 
other than the sink to a directed nearest neighbour. Hence, ignoring the directedness of 
the edges, an MDST on A" is a tree rooted at the sink m.[X). Note that an MDST is also 
a solution to a global optimization problem (see [7,22]) — that is, find a minimal-length 
spanning tree (ignoring directedness of the edges) such that each vertex is connected to the 
sink by a unique directed path, where directed edges must respect 

For X dW^ with card(A') > 2, let d^{^\X) denote the Euclidean distance from a non- 
minimal X G A to a directed nearest neighbour n(x; X) under =:<;^ and set d^{\n.{X)] X) = 0. 
For d eN and a > 0, define the total power-weighted edge-length of the MDST on X by 
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Figure 1: Realizations of the MDST under on 50 simulated uniform random points in 
(0,1)2 (^igf^) g^^^ (Q^^)3 (right). 

where an empty sum is 0. In particular, C^'^i^X) is the total Euclidean length of the MDST 
on X. Also, define the centred version £^'"(A') := C^'^'iX) - E[£'^''^(A')]. 

From now on we will take A" to be a random point set in (0, 1)'^. In particular, we will 
take a homogeneous Poisson point process P„ of intensity n on (0, 1)"^. Note that in this 
random setting, each point of Vn almost surely has a unique x^-coordinate and at most one 
directed nearest neighbour under so that Vn has a unique MDST, which is rooted at 
m(P„). 

We state and prove all of our main results in the present paper for the Poisson process Vn- 
In all cases, the authors believe that analogous results hold for the binomial point process 
consisting of n independent uniform random points on (0, 1)*^ instead; it should be possible 
to use standard de-Poissonization arguments (such as applied in similar circumstances in 
[22,23]) to verify this. 

In the present paper we are concerned with d > 2. When d = 1, coincides with the 
coordinatewise partial order (and indeed the total order < on M) and so our 'South' 
MDST is the same as the 'South- West' MDST here. Moreover, £^'°(P„) is a sum of powers 
of spacings of uniform points, and it can be studied using standard Dirichlet spacings results 
(see e.g. [8,9]). For instance. Darling (see [9], p. 245) essentially gives a central limit theorem 
for the binomial point process analogue of £^''^{Vn)- From now on we fix d e {2, 3, . . .}. 

Our first result describes the first-order behaviour of £'^'"('Pn) as n — *■ cxo. In particular, 
we have a law of large numbers for a G (0, d), and also asymptotic results for the expectation 
when a > d. In d = 2, the binomial point process version of Theorem |2.1[ i) is contained in 
the (f) = IT case of Theorem 5 of [32]. For (i G N, let 

v,:=7i''/'[Til + id/2))r\ (1) 

the volume of the unit rf-ball (see e.g. [13] equation (6.50)); here r(-) denotes the Euler 
Gamma function. 
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Theorem 2.1 Suppose d & {2,3,4, .. .}. 



(i) Suppose a G {0,d). Then as n ^ oo, 

^Wd)-i^d,a(P^) ^ 2°/'^r(l + m L\ (2) 

(a) Suppose a > d. Then there exists fi'{d, a) G (0, oo) such that, as n ^ oo 

E[C'''^{rn)]^f^'{d,a). (3) 

Moreover, we can express 

n'id, a) = fi{d -l,a) + l{«=42t;~\ 
where constants fi{d — 1, a) G (0, oo) can be characterized in terms of limits of certain 



on-line nearest-neighbour graphs: ■) is as given in Proposition 2.1 of [33]; see (71) 
below. In particular, for a > 2 

2 ( 2"" 



a{a + 1) \ a — 1 



One can generalize the statement of Theorem 2.1 i) to more general point processes under 



certain conditions; see [19, 20] for a general framework. 



Our second main result (Theorem 2.2, below) presents convergence in distribution results 
for C'^'"'{Vn)] the distributional limits contain Gaussian random variables and also random 
variables defined as distributional limits of certain on-line nearest-neighbour graphs (see 
Section |3|. In general we do not give an explicit description of the latter distributions. 
However, in the case of c? = 2, the limits in question can be characterized as solutions to 
distributional fixed-point equations, which we describe at the end of this section. 

We now state our main convergence in distribution result. Let Af{0, s^) denote the normal 
distribution with mean zero and variance > 0; included is the degenerate case A/'(0,0). 

By '— ^iU' we denote convergence in distribution. 

Theorem 2.2 Suppose d G {2, 3, 4, . . .} and a > 0. Then there exists a constant G [0, oo) 
such that, for a normal random variable Wa ~ A/'(0, s^), as n ^ oo.' 

^Wd)-(i/2)^d,a^P^) (0 < « < d/2); 

^'•'/'(K) ^ + Qid - 1, rf/2); 
jC^'»{V„) ^Q{d-l,a) (a > d/2). 



Here the Q{d — l,a) are mean-zero random variables as given in Lemma 3.2 below and 
independent of the W^; in particular Q{l,a) = Ga for a > 1, where Go. has the distribution 
given by ^ or ^ below. 
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Remarks, (a) It can be shown that the hmiting variance s\ of the normal component in 
the above hmits is strictly positive for a > 0, using, for example, techniques similar to those 
in [3] or [27] (see Lemma 6.2 of the extended version of [23] for an example of such a result 
for a different MDST model). 

(b) The normal random variables Wa arise from the edges away from the lower boundary 



of the c?-cube (see Section 4.2). The variables Q{d — !,«) arise from the edges very close 
to the boundary, where the MDST is asymptotically close to a (d — l)-dimensional on-line 
nearest-neighbour graph: this is formalized in Section |5] below. 



(c) Theorem 2.2 indicates a phase transition in the character of the limit law as a increases. 
The normal contribution dominates for a G (0,d/2), while the boundary contribution 
dominates for a > d/2. In the critical case a = d/2 (such as the natural case d = 2 and 
a = 1) neither effect dominates and both terms contribute significantly to the asymptotic 
behaviour. The intuition here is that increasing a increases the relative importance of long 
edges, such as, typically, those near to the boundary. 



(d) As will be demonstrated below (see Lemma 3.2), the random variables Q{d — 
can be characterized as distributional limits of certain on-line nearest-neighbour graphs. 
It is known (see [24]) that the Q{d — l,a) are non-Gaussian for a > d — 1. When d = 2 
much more is known (see [24]); Q{l,a) can be characterized in terms of a distributional 
fixed-point equation (see ([T]) and ^ below). In particular, Q{l,a) is non-Gaussian for 
a > 1. The authors suspect that for general d, Q{d — l,a) is in fact non-Gaussian for all 
a > d/2. 



Theorem 2^ below gives a convergence in distribution result on the length of the longest 
edge in the MDST on {Vn, ^*)- A similar result (in c? = 2 only) for the longest edge in the 
'South- West' MDST was given in [22]. Let £^^^(A') denote the length of the longest edge 
in the MDST (under ^*) on point set A* C (0, 1 



^te-^i^) ■= maxrf,(x; X) = max ||x - n(x; X) 



In the particular case d = 2, the distributional limit arising in Theorem 2.3| below is expressed 



in terms of the max-Dickman distribution (named after Dickman's work [10] on the asymp- 
totic distribution of large prime factors) , which can be characterized as the distribution of a 
random variable M satisfying the fixed-point equation 

M = max{l-f/,t/M}, (4) 

where U is uniform on (0,1) and independent of the M on the right. (Here and subsequently 

'=' denotes equality in distribution.) See [22,25] and references therein for more information 
on the max-Dickman distribution; it has appeared in many contexts, and a picture of part 
of its density function is on the front cover of the second edition of Billingsley's book [8]. 
In particular, we note that M can be characterized as the first component of the Poisson- 
Dirichlet distribution with parameter 1, and E[M] ^ 0.6243299 is Dickman's constant (see 
[10] p. 9). 

Theorem 2.3 Let d G {2, 3, . . . , }. There exists a random variable Qma,x{d — 1) such that 

^maySJ^n) ^ Qiaaxid — 1), 
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as n ^ oo. Moreover, Qraia{d — 1) is characterized in terms of the ONG (see Theorem 3.1 
below); in particular 

Q^ax(l) = max{f/M^i>, (1 - U)M^^^], 

where U, M^^^ and M^^l are independent random variables, U is uniform on (0,1), and 
M^^^ and M^"^^ have the max-Dickman distribution as given by 



We will derive Theorem 2.3 from a new result on the limiting distribution of the length 
of the longest edge in the ONG on uniform random points in (0, l)*^, which is of some 
independent interest: see Theorem 3A below. 

As promised, we no w gi ve a characterization of the limits Q{l,a), a > 1, arising in the 

First we define random variables Ja, ol > 1/2, with E[Jq,] = 
Dy the fixed-point equation 



2.2 



d = 2 case of Theorem 
and E[J^] < oo. Define Ji 



Ji = mm{U, 1-U} + UJ\^^ + (1 - f/)jf ^ + ^f/logf/ + ^(1 - U) log(l - [/), (5) 
and for a G (1/2, oo) \ {1}, define Ja by the fixed-point equation 

Ja = min{f/, (1 - U)Y + f/° Ji'^ + (1 - f/)" + (f^" + (1 - UT - 1) • (6) 

a — 1 

In each of these two equations (and subsequently), Y^^^ and Y^"^^ denote independent copies 
of the random variable F, and U denotes a uniform random variable on (0, 1) independent 
of the other random variables on the right-hand side of the equation. 

Note that Q and (g define unique square-integrable mean-zero solutions (see e.g. Theo- 
rem 3 of Rosier [29]), and hence the distributions of Ji and J a are uniquely defined. Moments 
of J a can be calculated recursively from (js) and ([6|; see [24] for some information on the 
first few moments of Ji, for example. From these moments one can deduce that Jq,, a > 1/2 
is not Gaussian. 

Now we can define random variables Ha, Ga, again with zero mean and finite variance. 
Define Hi by 

Hi = UJi + (1 - U)Hi + ^ + ^UlogU + ^(1 - U) log(l - U). 



For a e (1/2, oo) \ {1}, define Ha by 



Ha = U'^Ja + (1 - UrHa + ( 1 + ) + ((1 - t/)" - 1) ( - + ^. 

Define Gi by 

Gi ^ UH['^ + (1 - U)h[^^ + 1 + If/iogf/ + 1(1 - f/) log(l - U). (7) 
Finally, for a G (1/2, oo) \ {1}, define Ga by 

Ga = w^Hi'^ + (1 - urni'^ + {u^ + (1 - ur) (- + ^ ° 
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Once again, the distributions of and Ga are uniquely defined. It is the di strib ution of 
Gn {a > 1) as defined by (M) or (pj) that appears in the d 



2.2 



2 case of Theorem 

In the remainder of this paper, we prove Theorems 2A_, '2f2_ and 2^ First, in Section |3] we 
discuss the ONG, which we use to deal with the boundary effects in the MDST, and prove 
some new results, which are of some independent interest. In Section |4| we apply general 
results of Penrose [20,21] (see also [19]) to prove a law of large numbers and central limit 
theorem for the total weight of the MDST away from the boundary. In Section |5] we deal 
with the boundary effects themselves. Then in Section |6] we prove Theorem 2^ Finally, we 
complete the proofs of Theorem 2.2| in Section [7] and Theorem '1A_ in Section |8| 

Throughout the sequel we make repeated use of Slutsky's theorem (see, e.g., Durrett [11], 

p. 72), which says that for sequences of random variables (X„), (Yn) such that Xn — — ^ X 

P d P 

and Yn — > as n ^ cxd, we have X„ + Yn — X as n ^ oo. (Here and subsequently ' — 
denotes convergence in probability.) 



3 The on-line nearest-neighbour graph 

In this section we describe the on-line nearest-neighbour graph that we use to analyse the 
boundary effects in the total weight of the MDST under Some of the results that we 
will require are present in [24] and [33], but we will also need some new results, which we 
prove in this section. 

Let (Yi, Y2, . . .) be a sequence of vectors in W^, and for m G N set := (Yi, . . . , Y^). 
The on-line nearest-neighbour graph (ONG) on sequence is constructed by joining each 
point after the first of by a directed edge to its (Euclidean) nearest neighbour amongst 
those points that precede it in the sequence. That is, for i = 2, . . . , m we include the edge 
(Yj, Yj) where j G {1, . . . , i — 1} is such that 

\\Yj - Yillrf = min ||Yfc - Yi||d, 

" \<k<i " 

arbitrarily breaking any ties. 

In this way we obtain the ONG on 3^^, denoted 0NG(3^m) and which, ignoring direct- 
edness of edges, is a tree rooted at Yi. Denote the total power- weighted edge-length with 
exponent a > of 0NG(3^„) by 0'^'"(3^m), that is 

m 

O^^'^^y^) := J] min ||Y,-Y,||^; 

when is random, we denote the centred version by 0'^'"(3^m) := 0'^'"(3^m) — IE[0'^'"(3^m)]. 

Our primary interest is the case where is a sequence of uniform random vectors on 
the d-cube. Let d G N. Let (Ui,U2,...) be a sequence of independent uniform random 
vectors in (0, l)"'. For m G N, set Um ■= (Ui, . . . , Um)- We then consider ONG(W„). 

We also consider the ONG defined on a Poisson number of points. Let {N(t))t>o be the 
counting process of a homogeneous Poisson process of unit rate in (0,cxd), independent of 
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(Ui,U2, . . .)• Thus N{n) is a Poisson random variable with mean n. With Um as defined 
above set Un = l^N{n)', we then consider ONG(n„) Note that the points of the sequence n„ 
then constitute a homogeneous Poisson point process of intensity n on (0, l)*^. 
We need the following result, which is contained in Theorem 2.1 of [33]. 

Lemma 3.1 Suppose c/ G N. 

(i) For a G (0, d/2), there exists a constant C G (0, oo) such that for all n > 1 

Var[0'^'"(n„)] < Cn^-(2°/^). 

(a) For a = d/2, there exists a constant C G (0, oo) such that for all n > 1 

Var[0^'^/2^n„)] < Clog(l + n). 

The following result is contained in Theorem 2.2 of [33], with Theorem 2.2 of [24] used 
to deduce the final statement about the d = 1 case. 

Lemma 3.2 Suppose (i G N and a > d/2. Then there exists a mean-zero random variable 
Q{d, a) such that as n ^ oo 

a'^'"(n„) -^Q(d,a). 
In particular, Q{l,a) = Ga for a > 1, where Ga has distribution given by |^ or 



In order to deduce Theorem |2.3[ we use the following result on the length of longest edge 
of the ONG on uniform random points in (0, l)'', which adds to the analysis of the ONG 
given in [6,18,24,32,33]. For a sequence = (Yi, . . . , Y^) of points in M*^, let Cmax(3^m) 
denote the length of the longest edge in the ONG on 3^^: 

Cmax(3^m) := max min ||Y, - Yj\\d. 

2<i<m l<j<t 

For d = 1, where W„, = (f/i, . . . , f/„) and n„ = {Ui, . . . , Uj\fi^n)) for Ui, U2, ■ ■ ■ independent 
uniform random variables on (0, 1), we set W° := (0, Ui, . . . , f/„), i.e. W° is Un but with an 
initial point placed at the origin, and similarly 11° := (0, f/i, . . . , UN^n))- 

Theorem 3.1 Let d G N. 

(i) There exists a random variable Quia.x{d) such that as n ^ 00 

(a) When d = 1, we have in particular that 

g^ax(l) = max{UM^'\ (1 - f/)M^^>}, (9) 

where U, M^^\ M^'^^ are independent, U is uniform on (0,1) and M^^\ M^"^^ are 
max-Dickman random variables as given by Also as n 00 

OLM)^M; OLx(n°)^M, (10) 
where M is a max-Dickman random variable as given by 

10 



Proof. First we prove part (i). With probability 1, for all n, < O'^^y-ilAn) < d}^"^ and 
Cmax(Wn+i) > O'^^jUn). Hence Cmax(^n) ^ Qma.^{d) a.s., as n ^ oo, for some Qmax(c?)- 
Then by the coupling of n„ and Un and the fact that N{n) oo a.s., we have that with this 
coupling 0^^^(n„) converges to the same Qmax a.s. and hence in distribution (regardless of 
the coupling), completing the proof of part (i). 



We now prove part (ii) of the theorem, and so take d = 1. First we prove (10). Again 
by the couphng of n„ and W„, it suffices to prove that O^^^^iU^) —>■ M a.s. as n —>■ oo. The 
following argument is related to the proof of Theorem 2 of [22] . 

An upper record value in the sequence Xi, X2, X3, ... is a value Xi which exceeds max{Xi, 
. . ., (the ffist value Xi is also included as a record value). Let j(l), j(2), ... be the 

values of i G N such that Ui is an upper record in the sequence (f/i, U2, ■ ■ ■), arranged in 
increasing order so that 1 = j(l) < j(2) < ■ ■ ■ . Let Rn '■= maxjfc : j{k) < n} be the number 
of record values in the sequence Un = {Ui, . . . , Un)- 

A record Ui has by definition no preceding point in the sequence W„ to its right in the 
unit interval, and hence (in the ONG on U^) must be joined to its nearest neighbour to 
the left amongst those points already present, which is necessarily the previous record value 
when i > 1, or in the case of Ui. Then each non-record Ui lies in an interval between a 
record value and its nearest neighbour to the left, and hence gives rise to a shorter edge than 
that from some record value. Thus 

OLM) = max - f/,.(,_i)} , (11) 

where we set j(0) := and Uq := 0. For z G N set 



1 - Uj^i) 



1 - f/j(i-i) 

It is not hard to see that Vi, V2, . . . are mutually independent and each is uniformly dis- 
tributed over (0, 1). Therefore, setting 

M := max{l - Vi, Vi{l - V2), V^l - ^3), ^^1^3(1 - V^), . . .}, 

we obtain 

M = max{l -V,,V, max{l - V2, ^2(1 - V^), V2Vsil -V^),.. .}} 

= max{l -Vi,ViM'}, (12) 

where M' := max{l — V2,V2(1 — V3),V2V3(1 — V4),...} has the same distribution as M 
and is independent of Vi. Hence M has the max-Dickman distribution as given by (|4]). 
Furthermore, with the convention that an empty product is 1, 

(1 - V.) n = "T n (i^^) = - u„-r, (13) 



for A; G N. Also, Rn 00 almost surely as n — 00. Hence by (11), (12) and (13), 



i-1 >i ( i-\ 



Ol^M) = ^max ^ \i^-Vi)l[vA^ max (1 - Vi) J] Vk 



k=l J I k=l 
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where the convergence is almost sure. This proves (10). 



To complete the proof of part (ii) of the theorem, we need to prove ([9]). Conditioning on 
U = Ui and the number of points of (f/2, t/3, . . . , Un) that fall in each of the two intervals 
(0, U), {U, 1), we obtain by scaling that 



OLAUn) = max{f/OLx(W^), (1 - U)OI^M-i-l)}. 



(14) 



where in the right-hand expression lA^ = {0,Ui,U2, ■ ■ ■ ,Um), L ~ Bin(r;, — l,f/), and 
[/, [/i, U21 ■ ■ ■ 1 Uii U21 ■ ■ ■ are independent uniform random variables on (0, 1). Here L and 
n — 1 — L both tend to infinity a.s. as n — 00, and Cmax(^L) ^max(^n-i-L) in- 



dependent given L. Thus by (10) we have that 
distribi 
that Q 



M) and 01 



^i-l) converge m 



is the distributional limit of O^axv 



distribution to independent copies of the max-Dickman variable M. Then (14) and the fact 

(Un) yields (§. □ 



4 Limit theorems away from the boundary 

In this section we prove a law of large numbers and central limit theorem for the total power- 
weighted length of the MDST edges from points that are not too close to the base of the 
unit (i-cube. To do this, we employ some general results of Penrose [19-21]. 

Recently, notions of stabilizing functionals of point sets have proved to be a useful basis 
for a general methodology for establishing limit theorems for functionals of random point 



sets in M"^. See for example [18,20,21,26,27]. To prove the law of large numbers (Lemma |4.l[) 



and central limit theorem (Lemma 4.4) in this section, we make use of the general results 



on convergence of random measures in geometric probability given in [19-21]. These two 



lemmas will then form two of the ingredients for two of our main results. Theorems 2.1 and 

We use the following notation. Let c? G N. Let A" C M'^ be finite. For constant a > 0, and 
y e W^, let y + aX denote the transformed set {y + ax : x G X}. For x G M*^ and r > 0, let 
B{'x;r) be the closed Euclidean d-ball with centre x and radius r. For bounded measurable 
i? C M"^ let \R\ denote the (i- dimensional Lebesgue measure of R. Write for the origin of 

For a > 0, define the [0, oo)-valued function on finite non-empty X gM.'^ and x G A": 

e(x;A') :=4(x;A')", (15) 

and set ^(x; 0) := for any x. Then ^ is translation invariant (that is ^(y + x; y + A") = 
^(x; X) for all y G M'^, all finite X (Z M.'^ and x G A:") and homogeneous of order a (that is 
for any r > 0, ^(rx; rX) = r°^(x; X) for all finite X C R'^ and x G A"). For X C R'^ and 
x G R'^, write X"^ for X U {x}. If x ^ A", we abbreviate notation to ^(x; X) = ^(x; X""). The 
above definitions extend naturally to infinite but locally finite sets X (as in [20]). 
Let 

C'''^{X;R):= J2 ^(^5'^) (16) 

be the translation invariant functional defined on all finite point sets A" C M'' and all Borel 
sets R CR'^ induced by the function Then C'^''^{X; R) is the total power- weighted length 
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of the edges of the MDST on X originating from points in the region R. It is this functional 
that interests us here. When X is random, set C'^'°{X;R) := - E[£'=''"(A'; i?)]. 

Note that with our previous notation, C^'^'iX) = C'^''^{X- (0, 1)'^) for X C (0, 1)'^. 

Fix e E {0,1/d) (small). Let {gn)n>o be such that G (0,1) and gn = Q{n'^~^^^'^^) as 
n — > oo, where by a{n) = Q{b{n)) as n —>■ oo we mean 

a(n) ain) 
< limmf < limsup < 00. 

b[n) n-*oo b[n) 

Given gn, we introduce the family (r„)„>i of Borel subsets of M'^ given by 

r„:=(0, 1)^-1 x(^7„,l), (17) 

i.e. r„ is the unit d-cube without a thin strip at the base (in the sense). Note that the 
limiting set U„>ir„ = (0, 1)*^. Later on, in Section [7| we will make a more specific choice 
for gn- For n > 1, locally finite C M*^ and x G A" we define the scaled- up version of ^ 
restricted to r„ by 

U^;X) :=e(ni/'^x;ni/'^A')lr„(x). 



Then, from (16) 

£''''{X;rn) = ^^(x;A')lr„(x) =n-"/'^5^en(x; X), 



using the fact that ^ as given by ( |15[ ) is homogeneous of order a. We employ the following 
notion of stabilization (see [20,21]). 

Definition 4.1 For any locally finite X CW^ and Borel region A C M"^, define -Rg(0; X, A) 
(called the radius of stabilization for ^ at with respect to X and A) to be the smallest 
integer r > such that 

m n 5(0; r)) U 3^) = ^(0; X n 5(0; r)), 
for all finite y G A \ B{0; r). If no such r exists, set R$^{0; X, A) = 00. 
When A is all of R'^, we write R^{0] X) for R^{0; X, R'^). 



4.1 Law of large numbers 

We will apply a Poisson point process analogue of the law of large numbers Theorem 2.1 
of [20]. As mentioned on p. 1130 of [20], such a Poisson-sample result follows by similar 
arguments to the proofs in [20]; in fact such a result is stated and proved as Theorem 2.1 
in [19]. It is this latter result that we will use in this section. 

Let Hi denote a homogeneous Poisson point process of unit intensity on R'^. Our law of 
large numbers result for this section is the following. 

Lemma 4.1 Suppose d E {2, 3, . . .} and a > 0. As n 00 we have 

^ ^[^(Q. ^ 2"/V"^'r(l + (a/d)), (19) 

where the convergence is in L^, and Vd is given by lil). 
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The statement (19) will follow from Theorem 2.1 of [19] applied to our functional ^ as 
defined at (15), using (18). Thus we need to verify the conditions of Theorem 2.1 of [19]: 
(a) that R^{0; Hi) is almost surely finite; and (b) that there exists some p > 2 such that the 
following two moments conditions hold: 



sup E[^„(x;'P„)P] < cx), and 

n>l; xe(0,l)'* 

sup E[en(x;P^)P]<oo. 

n>l; x,yG(0,l)'^ 



(20) 
(21) 



The next two lemmas take care of this. 



Lemma 4.2 For ^ given by (15), the radius of stabilization R^{0;Hi) as defined in Defini- 
tion 4jJ_ is almost surely finite. 

Proof. Let R = d^{0;7ii). Then R is finite almost surely. For any i > R we have that 
^(0; {Hi n B{0; i)) uy) = i?", for any finite 3^ C M'^ \ 5(0; i). Thus taking R^{0; Hi) to be 
the smallest integer greater than R, i?^(0;7Yi) is almost surely finite. □ 



Lemma 4.3 Suppose d G {2,3, . . .} and a > 0. Then for (r„)„>i as given at (11) and ^ as 
given by (15) the moments conditions (20) and (21) hold for any p > 0. 



(22) 



Proof. We have from the definition of ^„ and (15) that 

sup E[e„(x;P„)P] = sup E[e(n^/'^x; n^/'^P„)P] = sup E[4(n^/'^x; n^/'^P^ 
xe(o,i)d xer„ xer„ 

For (i G {2, 3, . . .}, X G r„ and r > 0, define the region in the scaled-up space 

A^(x, r) := 5(n^/'^x; r) n (0, n}'y H {y G M'^ : y n^/'^x}. (23) 



For X G r„, define the variables Cn^^(x) := d^{n^/'^-K.]n^/'^Vn)'^{d,{yi■,Vr^)<gr,} and Cn^^(x) 
dM'''^-.n^''"Pn)Md.{^;V„)>a^}- For t > 0, 

P(Ci')(x) >t)= P({4(n^/"x;ni/'^P„) > t} n {d,{n'/''^;n'/''Vn) < n^/V})- 

This probability is clearly zero unless t < n^^'^gn, in which case, by the definition of r„ the 
region A'^(x,t) does not touch the hyperplane {xd = 0}, so that |y45^(x, t)| > 2~^Vdt'^, where 
Vd is the volume of the unit (i-ball given by ([T]). Hence for alH > 0, 

P(C«(x)>t)<exp(-2-V) 

so that for all n and all x G r„, (n\'x.) is stochastically dominated by a variable with 
cumulative distribution function F(t) = 1 — exp{—2~'^Vdt'^), t > 0. Such a variable has finite 
(a;p)-th moment. 

Also, for all n and all x G r„, the random variable Q (x) is bounded by the random 
variable d^/^n'^/'^l^a.in^/d^.^i/dp^^^^i/dg^}, so that 

E[(Ci')(x))"P] < ci"P/2^"P/dp^^^^^i/d^.^i/dp^) ^ ^i/dg^^ 
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and since n^^'^gn = Q{n^), this upper bound is bounded in n. Thus the (ap)-th moment of 
Cn\'K) is bounded uniformly over all n and all x G r„. Combined with the earlier uniform 
moment bound for (n\'x.) and (22), this yields (20). 
For ([21]), note that for any x G r„,y G (0, 1)^ 



a(x;py) = d^in'/^n'/^iVn U {y}))" < 4(ri^/"x; n^/'^K)" + l{7'„cr„}ri"/''f^"/'. 

Moreover, ^„(x; V^) is zero for x G (0, 1)'^ \ r„. Thus 

sup E[U^;pyY]< sup E[a(x;P„)1+P(P„cr„)n-^'/'^d"^'/2^ 
x,ye(o,i)'' xe(0,i)'* 



so that (20) implies (21) since P(P„, C r„) = exp(—ngn)- □ 



Proof of Lemma 4.1, From Theorem 2.1 of [19], with (18) and Lemmas 4.2 and 4.3, we 
obtain the convergence statement in (19). It remains to prove the final equality (19). We 
have, for s > 

P (e(0; Hi) > s) = P (7^1 n {x G M'^ : X 0} n 5(0; = 0) = exp (-(t;rf/2)s'^/"). 
Hence, 

/•oo /»oo 

E [^(0; Hi)] = / P (^(0; Hi) >s)ds= / exp(-(t;rf/2)s''/")ds, 
Jo Jo 

which by the change of variables y = {vd/2)s'^^°' is the same as 



a 



^2-'%T" I y^"-"^-' exp(-y)dy = -^2-l%r"nc^ldl 

by Euler's Gamma integral (see e.g. 6.1.1 in [1]). The desired equality now follows from the 
functional relation xT{x) = r(l + x) (see 6.1.15 in [1]). □ 



4.2 Central limit theorem 



We again consider C '°'{Vn', ^n) as given by (18). In this section we aim to prove a central 



limit theorem complementing the law of large numbers of Section 4J^ This time, we will 
apply Theorems 2.1 and 2.2 of [21] to give the following result. 

Lemma 4.4 Let d G {2,3,...} and a > 0. There exists a constant Sa G [0, 00), not 
depending on the choice of e or the sequence gn, such that, as n ^ 00, 



lim (n(2"/'^)-^Var [£'^'"(P„;r„)]) = lim U-^Var 



and 



A/"(o, si: 
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Proof. In order to prove this lemma, we need to verify the conditions of Theorems 2.1 and 



2.2 of [21] (see also Theorem 2.2 and 2.3 of [19]) for our function ^ as given by (15). In 



addition to the moments conditions (20), (21) (as shown to hold in Lemma 4.3), we need to 
demonstrate the following additional stabilization conditions: 



for all z G 



and 



lim sup s ^ log ( sup 

yn>l; xer, 



P(/?^(n'/'^x;n'/'^P„ 



n 



l/d 



(o,ir 



> s 



< 0. 



(24) 



(25) 



Condition (24) requires that the radius of stabilization is almost surely finite on the addition 



of an arbitrary extra point to Hi, and condition (25) requires exponential decay of the tail 
of the radius of stabilization. 



Given Lemma 4.2, (24) is clear, since with probability 1 the addition of any extra point 
z G M"' to Til can only decrease the radius of stabilization at 0. 

We need to prove (25). Let |y4^(x, r)| be defined by (23), and for z = {zi, Z2, . . . , Zd) G 
n^^'^Tn, let m(z) := Zd, the last component of z. For > 2, we claim that there are finite 



constants > and uq > 1 such that 



l^n(x,r) 



> Cdr'^-' if r G (1, rf^/'n^/'^] 



(26) 



for all 72 G N with n > Hq, and any x G r„. 

We verify the claim (26). Take Uq such that for all n > no we have n^^'^gn > 1. Then 



for n > uq, suppose r G (1, d^^'^n^^^]. For a lower bound on the volume of v4^(x, r), consider 
x = (0, 0, . . . , 0, m(x)), the 'worst case'. Let /ix denote the hyperplane {y G n^^^Tn : rn{'y) = 
m(n^/'^x)}. Let r' := d~^^'^r, so r' < n^^^. Then let Wi, W2, . . . , w^.i denote the d—1 points 
of (r', 0, 0, . . . , 0, m(ni/rfx)), (0, r', 0, . . . , 0, m{n^/'^x)), . . . (0, 0, . . . , 0, r', m{n^/'^x.)), and let 
Wo denote the point (0, 0, ... , 0, m{n^^'^x) — 1). Then since x G r„, the d-dimensional 'right 
pyramid' defined by vertices wo, n^/'^x, wi, . . . , Wd~i is contained within both (0, n^/'^Y and 
the half-ball i?(n^/'^x; r)n{y G M"^ : y n^/'^x}. The volume of this 'pyramid' is d\ 



This gives a lower bound for |A^(x, r)|, and (26) holds as claimed. 

To prove (pi, note that n^/'^P^ is a homogeneous Poisson p oint process of unit intensity 

yields 



4.2 



on {0,71^/"^^. Then for s > 1, arguing as in the proof of Lemma 

P (i?5(n^/'^x; n'/'^Pn, n'/'^iO, if) > s) < P {d,{n'^'^x; n^'^Vn) > s-l) 

<exp(-|A:^(x,.-l)|). 



So by (26), for n > uq and 2 < s < d^^'^n^^'^ + 1, we obtain, 

sup P (i?^(ni/^x; n^/^Vn,n^^^{0, if) > s) < exp {-Cd{s - 1)^~^) 



xer„ 



Also, this probability is zero for s > d^f^ii}/'^ + 1. Thus for any s > d^^'^nl^'^ + 1, 

x; n'/'^Vn, ni/"(0, 1)") > s) < exp {-Cdi-s - 1)''-') , 



sup P {R^{n 

n>l: xGr„ 



l/d. 



and (25) follows. □ 
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5 Boundary effects in the MDST 



In this section, we consider the contribution to the total power-weighted length of the MDST 
under =^=t due to boundary effects near the 'bottom face' of the li-cube. Here the possibility of 
long edges leads to rather special behaviour. We shall see that the on-line nearest-neighbour 
graph, as described in Section |3| will be a useful tool here. 

Fix e > small. Let (tn)n>o be such that t„ G (0, 1) and t„ = Q(n^^^^'^^~'^) as n — > 
oo (we make a specific choice for t„ in Section [?]). Let i?„ denote the boundary region 
(0,1)'^"^ X {0,tn], i.e. we look in a thin slice at the base (in the sense of of the unit 



(i-cube. Recall from (16) that £'^'"(A'; R) denotes the contribution to the total weight of the 
MDST on X from those points oi X n R, and £'^'"(A';/2) := C^'^^iX^R) - ElC^'^^iX; R)]. 
Also recall that Vn denotes a homogeneous Poisson point process of intensity n on (0, 1)*^. 
Our main result of this section is the following. 

Theorem 5.1 Suppose d E {2, 3, . . .}. Let e > and tn = Q{n^^^^'^^~'^) specify Bn- 



(i) Suppose a > d/2. With Q{d — l,a) as in Lemma 3.2, we have that as n ^ oo 



£^'"(P„;i?O^Q(rf-l,a). (27) 

(a) Suppose a G {0,d/2). As n oo, 

^Wd)-(i/2)^d,a(P^. Q ^28) 



The idea behind the proof of Theorem 5.1 is to show that the MDST under near to the 
boundary is close to an ONG defined on a sequence of uniform random vectors in (0, l)'^^^ 
coupled to the points of the MDST in Bn- To do this, we produce an explicit sequence of 
random variables on which we construct the ONG coupled to Vn on which the MDST is 
constructed. 

Define the point process 

Wn := Vn n Bn. (29) 

Let Pn '■= card(>V„). List >V„ in order of increasing x^-coordinate as Uj, i = 1,2, . . . In 
coordinates, set = {Ul, Uf,..., Uf) for each i. Let = (t//, . . . , Uf~^) G (0, 1)^"^ be 
the projection of Uj down (in the e^ sense) onto the base of the unit rf-cube. Set 

V„:=(Vi,...,V^J. (30) 

Then V„ is a sequence of uniform random vectors in (0, l)''^^ (the base of the unit d-cube), 
on which we may construct the ONG as appropriate. Note that the points of V„ in fact 
constitute a homogeneous Poisson point process of intensity ntn = 6(n''^/^-'~^) on (0, l)'^^^ 
(this follows from the Mapping Theorem, see [15]). With the ONG weight functional (9'^'"(-) 
defined in Section |3} the ONG weight C'^~^'°(V„) is coupled in a natural way to C'-'^iyVn) = 

C^'^iVn^Bn). 



Our first step towards Theorem 5.1 is the following result, which shows that, near the 



boundary, the MDST is close to the coupled ONG. 
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Lemma 5.1 Suppose d E {2,3,...}. Let e > and tn = Q{n ^^^"^^ specify Bn- Let 
W„, V„ he as defined at (29), (30) respectively. For a>l, as n oc, 

^'^•"(W^) - ^'^-^•"(VO ^ 0, zn L\ (31) 

and, for a G (0, 1), as n — > oo, 

E |/:'^'"(W„) - 0'^-^'°(V„) I = O (^(i/2)-^-"((i/2)+.)) _ (32) 

Proof. We construct the MDST on the set of points W„, and construct the ONG on the 
projections down onto (0,1)'^"^, V„. With a shght abuse of notation, consider the points 
U, = (V„f//),z = l,...,/5„. 

Note that by construction of the MDST on and the ONG, and our choice of ordering of 
points, we have that Uj Uj if and only if j < i. Thus either an edge exists from Uj in the 
MDST and also from Vj in the ONG, or from neither. For the difference between the total 
weights of the two models, it suffices to consider the case in which both edges exist. Then Vj 
is joined to a point 'Voii), D{i) < z in the ONG, and Uj to a point U in the MDST; we 
do not necessarily have J{i) = D{i). Since J{i) < i by construction of the MDST on and 
the ordering of our points, we have that Vj(j) was an admissible candidate to be the point 
that Vj joins to in the ONG. Therefore, we have that ||Vj — ^ D{i)\\d-i < — ^ j{i)\\d-i- 
It then follows that 

||(V„[//) - (V,(,),f/j'(,))||° > ||V, - V^(,)||^_, > \\V,-Yr>ii)\\U. (33) 
and so we have that, for all a > 0, 

C'^-i'"(V„) < C'^'^iyVn). (34) 



Also, by construction of the ONG and our ordering on points, we see (V/)(j), t/^,^.-)) 
(Vj, Uf). So by the construction of the MDST, we have that 

\\iy,,uf) - (v,(,),f/j(,))||, < \\{Y,,uf) - iy o(^,.u^^,))\U. m 

If (x, y) G (0, l)''"-^ X (0, 1) then ||(x, y)||d < ||x||rf_i + y, and by the Mean Value Theorem 
for the function t \^ t"", for a > 1, 

||(x,y)||^ - ||X||^_, < (||x||,_i + y)" - ||X||^_, < a{{d - 1)1/2 ^ ^y-ly > 

So we have that, for d > 2 and a > 1, there is a finite positive C such that, a.s., 

\m,Uf) - (V^(,),f/^(,))||^ - ||V, - V^(,)||°_, < CiUf - [/^(,)). (36) 



Then (35) and (36) yield, for a > 1, a.s., 

||(V„ Uf) - (V^(,), f/j(,))l|^ - ||V, - VDi^)\\d-l < C{Ut - Kir)) < Ctn, (37) 

which implies that there exist C, C G (0, oo) such that for all n > 1 

^'^'"(Wn) - 0''-i'"(V„) < Cpjn < C"/5„n-(i/2)-e^ (38) 
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Combining (34) and (38) we have that, for a > 1, some C G (0, oo) and all n > 1, a.s., 

Taking expectations, using the facts that /?„ is Poisson with mean ntn = e(n(i/2)-£) and 
£ > 0, we obtain (31). 

Now we consider the case a G (0, 1). By the concavity of the function 1^1°" for 
a G (0, 1), we have for (x, y) G (0, l)"^"^ x (0, 1) that 

||(x,i/)||^ - ||x||^„i < (||x|U_i + yY - ||x||^_i <y^ (0 < a < 1). 

Then, by a similar argument to the a > 1 case, we obtain 



so taking expectations yields (32). □ 



Lemma 5.2 Suppose c? G {2, 3, . . .} and a > d/2. Let Wn be as defined at (29) and suppose 
that Q{d — l,a) is the mean- zero random variable given in Lemma 3.2. Then as n ^ oo 

i:'''"(Wn) ^g(rf-l,a). (39) 

In particular, Q{l,a) = Ga for a>l, where Ga has the distribution given by ^ or 

Proof. For a > d/2 > 1, it follows from (31) that as n ^ oo 

£rf."(>V^) _ 0'^-i."(V„) ^ 0. (40) 

Also, since V„ is a homogeneous Poisson point process of intensity nt„ = ©(n*^^/^)^^) on 
(0, 1)*^"^, and a > d/2 > {d — l)/2, we have from Lemma 3.2 that as n ^ oo 



(V„) 



Q{d — 1, a). 



(41) 



Thus (40), (41) and Slutsky's theorem complete the proof of (39). □ 



Proof of Theorem |5.1[ For a > d/2, ( [27| ) follows from ( [39| ). Now suppose a G (0,(i/2). 

0, 



First suppose that a < 1. Then (32) implies that for a G (0, 1) 

E [n("/<^)-(i/2) |/:^."(p„.5„) - 0'^-^'"(V„,)|] = 0(r^"«l/^)-(^/2))-e(l+a)^ 



since d> 2 and e > 0. So for a G (0, 1) we have 

^{a/d)-(i/2) _ ^ 0, in L\ 



(42) 



as n ^ oo. Also, (31) implies that (42) also holds for a G [l,d/2) when d > 3. Thus (42) 
holds for all a G (0, d/2). Recall that V„ is a homogeneous Poisson point process in (0, 1) ^ 
with intensity ntn = 0(^*-^^^-'~^). li a < (d — l)/2, then by Lemma 3.1 'i) and (ii) we have 
that for some C G (0, oo) 



Var 



^ia/dyii/2)^d-i,a^y^^ < Cn^^^^-^ (n^^/^)-^) ^^(^./(d-l)) ^ 
< Cn''i(2M)-i/id-i))-ii/2) ^ < ^^-i/d ^ ^ 

as n ^ oo 



as n — > oo. If a G {{d — l)/2, d/2), then by Lemma 3.2 



0. 



So by Slutsky's theorem with (42) we obtain (28). □ 
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6 Proof of Theorem 



2.3 



In this section we are interested in the longest edge in the MDST under on Vn C (0, l)*^. 
The intuition behind Theorem 2.3 is that this edge is hkely to be near the lower {d — 1)- 



dimensional boundary. Thus we again make use of the fact that the MDST near the boundary 
is well-approximated by the appropriate on-line nearest-neighbour graph. Then we deduce 
Theorem 2^ from Theorem 3A using the set-up of Section |5| 

From Section [5] recall that for fixed e > 0, Bn denotes the boundary region 
(0,l)'^-i X (0,t„] (where t„ = Q{n~^^''^'^-')), and from (|2§ that W„ = P„ n B^. 



Also, recall from (30) that Vn is the sequence of {d — l)-dimensional projections of Wn in 



order of increasing x^-coordinate. 



Proof of Theorem 2.3 We have from (33) that every edge in the ONG on V„ has length 



bounded above by the length of some edge in the MDST on >V„. On the other hand, we 



have from (37) that an edge from Uj G Wn in the MDST has length at most 0{tn) greater 
than the edge in the ONG from the corresponding Vj G V„. Thus we have that for some 
C G (0, oo) and all n > 1 

< £Lx(W„) - Otx(Vn) < Cn-(i/^)-. 

Hence almost surely 

^Lx(Wn) - 0tx(V„) — 0, 

as n ^ OO. By Theorem 3.1[ i) and the fact that V„ is a homogeneous Poisson point process 
of intensity ntn — > c>o (for e small), we have 

(^max(Vn)^Q.nax(rf-l), 

as n ^ OO. Hence Slutsky's theorem implies that 

^max(>V„)^g.nax(rf-l), (43) 

as n ^ OO. Set 

M„ := max dJyL-^Vn)-, 

the length of the longest edge in the MDST from points of Vn in the region (0, 1)*^"^ x (t„, 1). 
Then for any n > 1, /:Lx(^n) = max{/:^,,(W„), M„}; thus 



(44) 



Hence by (44), (43), and Slutsky's theorem, to complete the proof of the theorem it suffices 



to show that as oo 

Mn ^ 0. (45) 
We prove this last statement. For e > as before and {ii, . . . ,id) G N'^, define the cuboid 
C(zi,...,^,) :=((zi-l)KJ-\ziKJ-i]x.--x((z,_i-l)KJ-\^,_iKJ-i]x((z,-l)[t;ij-\^,[t;ij-^^ 
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Let En denote the event 

U {P„nc(zi,...,2,) = 0}. 

(n,...,id)eNdn[(o,KJ]''-ix(o,Lt;r'j]] 

The number of points of Vn in each cuboid C{ii, . . . ,id) in the union is Poisson distributed 
with mean 

and the total number of cuboids in the union is [n'\'^-\t-^\ = 0(n(^/2)+de)_ Thus Boole's 
inequality implies that there exist C, C G (0, oo) for which, for all n > 1, 

and hence P(-En) ^ as n — > oo, for e small enough. However, if En does not occur then 
each cuboid contains at least one point of Vn and M„ is bounded by a constant times n~^. 
Thus (45) follows and the proof is complete. □ 



7 Proof of Theorem 2.2 



In this section we complete the proof of our convergence in distribution result for £ '"(P, 



Theorem 2.2 Recall from Section |4] that e > is fixed (small) and r„ denotes the region 
(0,1)'^""'^ X (gn,l), where gn = Q{n'^~^^^'''^) as — > oo. As in Section [sj denote by i?„ the 
region (0, 1)*^"^ x {0,tn\, where t„ = B(n~(^/^''~^). We will make a particular choice for gn 
and tn shortly. Denote by /„ the intermediate region (0, l)*^ \ U r„) = (0, 1)*^"^ x gn]. 



In order to prove Theorem 2.2, we need to collect previous results on the limiting be- 



haviour of the MDST in the regions r„ and Bn, and also deal with the region J„. In Sections 



4.2 and ^ we saw that, for large n, the weight (suitably centred and scaled) of edges starting 



in r„ satisfies a central limit theorem, and the weight of edges starting in Bn can be ap- 



proximated by the on-line nearest-neighbour graph. To complete the proof of Theorem |2.2 
we shall show that (with a suitable scaling factor for a < d/2) the contribution to the total 
weight from points in In has variance converging to zero, and that the lengths from i?„ and 
Tn are asymptotically independent by virtue of the fact that the configuration of points in 
In is (with probability approaching one) sufficient to ensure that the configuration of points 
in Bn has no effect on the edges from points in r„. 



Recall from (16) that for a point set A" C M'' and a region R C M'^, £'^'"(A'; R) denotes 
the total weight of edges of the MDST on X which originate in the region R. The next result 
is the main result of this section: it gives asymptotic control of the variance of C'^'°'{Vn', In), 



and will allow us to complete the proof of Theorem 2.2 



Lemma 7.1 Suppose d G {2, 3, 4, . . .} and a > 0. Then for small enough e > there exist 
Qn = Q{n^~^^^'^^) and tn = Q{n~^^^'^^~^) specifying In for which, as n ^ oo, 

Var[/:'^'"(P„; /„)] ^0, {a > {d ~ l)/2), (46) 
and Var[n(°/'^)-(i/2)£rf,a^P^. ^ q, (0 < a < d/2). (47) 
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Before embarking on the proof of Lemma 7.1, we prove the following preliminary result 



which, for our purposes, will control the dependency structure of the MDST. Let X he a set 
of points in (0, 1)'^. For non-empty A" and x G A", let -D*(x; X) denote the total degree of x 
(i.e. the total number of directed edges that have x as one endpoint) in the MDST on X; 
set /^^(x; 0) := for any x. 

Lemma 7.2 Let d> 2. For any e G (0, 1) there exist C, C G (0, oo) such that for all n > 1 



P ( sup ^.(x; Vn) > riA < Cexp{-C'n' 



Proof. Suppose d > 2. Fix n G N. Let (Ui, . . . , U„) be the points of a binomial point 
process of n independent uniform random vectors on (0, l)*^ , listed in order of increasing 
Xrf-coordinate, so that Ui U2 ■ • • U„. Set := {Ui, . . . , U„}. 

We now consider our usual coupling of the MDST to the ONG. In coordinates, write 
Vi = {Ul,...,Uf). Set Vi = {Ul,..., Uf-^), the projection of Ui down (in the e^-sense j 
onto (0,1)'^-^ 

With probability one, the \Jj, \j have distinct d-, [d — l)-dimensional inter-point dis- 
tances, so there are no ties to break in constructing the MDST or ONG. Consider a point 
\Jj with j G {1, ... ,n — 1}. Suppose that U^, j < k < n is joined to Uj in the MDST on 
Xn- Then ||Ufc - Uj||d < ||Ufe - Ui||d for i G {j + 1, . . . , A; - 1}. Also 

\\'^k-y^\\t^ = \\Uk-U^\\l-iUt-Utr. 

Then since Uf is increasing in i, || V^t — Vj||rf_i is minimized over i G {j, . . . , k — 1} hj i = j . 
In other words, a necessary condition for U^, j < k < n, to be joined to Uj in the MDST on 
Xn is that the corresponding edge from to exists in the ONG on sequence of points 
(V,-,V,+i,...,V„) in iO,lY-\ 

Hence the in-degree of Uj in the MDST on Xn is bounded above by the in-degree of 
Vj in the ONG on (Vj, Vj+i, . . . , V„). Since Vi, . . . , V„ are independent uniform random 
vectors in (0, 1)'^"^, we have that this latter quantity has the same distribution as the degree 
of Vi in the ONG on (Vi, V2, . . . , V„_j+i). Hence D^(\Jj] Xn) is stochastically dominated 
by the degree of Vi in the ONG on (Vi, V2, . . . , V„), which we denote Dong(^)- 

Hence 

sup F{D,{Vj;Xn) >s)< P(Dong(^) > s). 
Then by Boole's inequality, we have that 

P ( sup D.iXJf Xn) >s] <f2 sup HD^iVi- Xn) >s)< nF{DoNG{n) > s). 

\l<j<n J ~^ l<i<n 

Let N{n) = card(P„). We have 

sup D^iVj; XN(n)) > s] < F{N{n) > 2n) + sup P ( sup D^([Jj; X^) > s] 

[i<j<N{n) 1 m<2n \l<j<m J 

< F{N{n) > 2n) + 2nP(DoNG(2n) > s). 
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By following the argument in Section 3.1 of [6] we have that for any e > 0, 
PpONclSn) > n^) = 0(exp(-W)). Also, F{N{n) > 2n) = 0(exp(-Cn)) by stan- 
dard Poisson tail bounds (e.g. Lemma 1.2 in [17]). This completes the proof. □ 



To prove Lemma 7.1 we first derive an upper bound (52 below) for Var[£'^'°('Pn; /„)] 
in terms of the mean-square changes in C^'"" [Vn] In) on re-sampling Poisson points over a 
certain partition of Bn U J„ into boxes, in a similar way to a technique in [26]. Unlike in [26], 
where the boxes are the same shape and size, we need to use boxes of different shapes to 
take account of the structure of the MDST near the boundary. 

For each n > 1, we will divide (0, l)'^ into layers of rectangular d-cells. To begin we will 
divide (0,1)*^"^ x (0, oo) into layers starting at the base (in the sense). The k-th layer 
{k G N) will have height hn{k) given by 

hrrik) ■= n-^+'2^k-i)(d-i) _ 

We will let Hn{k) denote the starting height (in the sense) of layer /c; define -ffn(l) := 
and for k > 2 

k-l k-2 
i=l i=0 

where q = (2^^^^ — 1)~^ depends only on d. We then define the box 

L^{k) := (0, ly-' X {H^{k), H^{k + 1)] {ken); 

we will refer to Ln{k) as the k-th layer. 
For n > 1 define M„ G N such that 

Mn := min{m G N : Hn{m + 1) > n"^^/^)"^}. 

Then M„ satisfies 

M„ = e(logn), 2^" = e(r2(^-^^)/(2(d-i)))_ ^4g) 
We then define for n > 1 the region 



E„ := U L^{k) = (0, l)'^-! X (0, if„(M„ + 1)]. 



k=l 

Then with our previous notation as Bn = (0, l)'^^^ x (0,t„), we have t„ = Hn{Mn + 1) = 

Also for n > 1 define Kn G N such that 

Kn := min{A; G N : Hn{k + 1) > n"-^^/'^)}. 

Thus 

Kn := e(logn), 2^" = Q{n^/'^). (49) 
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Define for n > 1 the region 

In:= U L„(fc) = (0,l)'^-i X (if„(M„ + l),i7„(ir„ + l)], (50) 

A:=M„+1 

SO tliat, witli our previous notation for /„, = Hn{Kn + 1) = 6(n^^(^/'^)). Tliese specific 
clioices for t„ and gn tlien fit witli our previous usage. 

We now subdivide eacli layer into cells. For k = 1,2, . . . , Kn, divide layer k into rectan- 
gular cells of height hn{k) by forming a grid by dividing each of the d — 1 sides of the layer 
into 2^^"^ equal intervals. Thus layer k then consists of 2'^*"'"^^*^'^"^) cells of height hn{k) and 
(d — l)-widths 2^"^^. Each such cell has volume 2^^~''^^'^~^^hn{k) = n~^^^. The total number 
of cells in all the layers up to layer Kn we denote by £{n), which is given by 

e{n) := J2 2^''-'^^'^-'^ = 0(2(^-1)^'" ) = Q{n'-^'/'^^), (51) 
fc=i 



by (49). For layers 1 up to Kn, label the individual cells lexicographically as S^, i = 
1,2,..., i{n). 

Note that for e small enough, cells in layer k < Mn are always wider than they are tall, 
while for M„ < k < Kn cells in layer k have height at most a constant times times their 
width. 

Let Vn denote an independent copy of the homogeneous Poisson point process Vn, and 
for i = 1, 2, . . . , i{n) set 

p;:=(p„,\^r)u(Pnn^r), 

so that Vn is Vn but with the Poisson points in independently re-sampled. For ease of 
notation during this proof, for n > set F„ = £'^'"('P„; /„). Define 

:= £"'"(n; In) - i:"'"(Pn; In) = /:''"(K; Q - c'^'^iVn, 4), 

the change in Yn on re-sampling the Poisson points in S*". By Steele's [30] version of the 
Efron-Stein inequality, or by a martingale difference argument, we have that for n > 

£{n) 

Var[£'^'"(P„; /„)] = E[Y^] < J^nm']- (52) 

i=l 

For i = 1, 2, . . . , i{n), let G{i) be the integer k E {1, . . . , Kn} such that Si C L{k), so that 
G{i) is the layer to which Si belongs. Formally, 

Gi^) := \{d-l)-Hog,[{2'-'-l)t + l]]. 

The next result gives bounds on E[(A")^]. 

Lemma 7.3 Let d G {2, 3, . . .} and a > 0. There exists C G (0, oo) such that for all n > 
and all i E {1,2, ... , £{n)} 

{r< {6+4a)e -a/{d-l) jfn(j) < A/f 
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Note that 2-2"G(i) = @(^-2a/(d-i)) as i -> cx), and for G{i) < M„ or G{i) < K^, 
i = 0(n*^^"'^^^/^) or i = 0{n^~^^^'^^) respectively. 



Proof of Lemma 7.3 , Let E'^ denote the event that every cell S*" C {B„ U /„) contains at 
least one and not more than n^^ points of Vn, and also Vn- That is, 



E': 



Pi h < card(Pn n S;) < n^', 1 < card(P„ n S^) < n 



2e 



l<j<l{n) 



We have from Boole's inequality and the fact that card('Pn fl S'j) has the same distribution 
as card(P„ n 5"^") 



P(«)'^) < 2 ^ P ({1 < card(P„ n S]) < n^'Y) 

l<j<e{n) 

= 2i{n) [P(card(P„ n S'^) > n^') + P(card(P„ H SJ) = 0)] . 



(54) 



-l+e^ 



Now card (P„ PIS'"), j = 1, . . . , £(n) are Poisson distributed with mean (since \Sj\ = 
By standard Chernoff bounds on Poisson tails (see e.g. Lemma 1.2 of [17]), we have that 
P(card(P„ n 5") > n^^) = 0{exp{-Cn^nogn)), whereas P(card(P„ n ^") = 0) = exp(-n"). 



Thus from ( |54[ ), using (51), there exists C E (0, oo) such that 

P((K)') = O {n^-^^/^'^ exp{-n')) = 0{exp{-Cn')), 



(55) 



as 77. — i> CXD. 

Also, for e > and 77 > let E'^ denote the event that the maximum vertex degree in 
the MDST on Vn and on for each i is bounded by 77^; i.e. 



sup 



sup -D=„(x; X) < rf 



Then by Lemma 7.2 we have that for some C G (0, cxd), 

niKT) = 0(exp(-W)). 

Let 

En := K n El 



(56) 



(57) 



Then P(E^) < f'{{E'^f) + P((K')'') so that by ([55]) and ^ we have that there exists 
C G (0, oc) such that as 77 ^ 00 



P(E^) = 0(exp(-C77")). 



(5^ 



We bound E[(A") ] by partitioning over the occurrence of En and using the fact that 



n{KY]<n{^iY\En]+n{KYu 



(59) 
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First note that by the Cauchy-Schwarz inequahty and the trivial bound | A"| < C(card('P„) + 
card('P„)), we have that 



E[(A 



n\2n 



where N{n), N'{n) are independent Poisson random variables with mean n. Hence from (58) 
we have that for some C G (0, oo) 



E[{A^yiE^J = 0(exp(-Cn^)). 



(60) 



Next we treat the case where En occurs. First suppose G{i) < Mn, so that S'f C B^- 
Contributions to are from directed edges from Poisson points in to Poisson points 
in specifically, such edges that are added or deleted on the re-sampling of the Poisson 
points in S^. The number of such edges is bounded by the sums of the vertex degrees in 
the MDST of points of P„ n and P„ n . Given En, the number of points of P„ n is 
bounded by n^^, similarly with P„, and each point has degree bounded by n^. It follows that 
the number of edges that can contribute to A" is bounded by 2n^^ under En- Further, given 
En, the length of an edge contributing to A" is bounded by a constant times the width of 
cells in L(M„ + 1) the first layer in In, which for > 2 is 0(2"*^") = 0(n2^-(^/(2(d-i)))) 
(48). Each edge therefore gives a contribution to A" at most 0(?t,^"'^~'^"/'^^(°'~^^^)) in absolute 



value. It follows that there exists C G (0, oo) such that for all n > and all i with G{i) < Mn 



E[{A1'f I En] < Cn('5+^")^n-°/('^-i). 



(61) 



Thus from (59) with the bounds (60) and (61) we obtain the G{i) < Mn case of (53). 



Finally suppose M„ < G{i) < Kn, so that S*" C J„. Given En, the number of points of 
Vn n is bounded by n^^; similarly for Vn- Further, given En, edge lengths contributing 
to A" are bounded by a constant times n'^ times the width of cell in layer G{i), which is 
0(2"*^*^*^), and each point has degree bounded by n^. Thus for M„ < G{i) < Kn, 



E[(A^)2 I En] = 0(n(6+2a). . 2-2«G(i))_ 



(62) 



Then (59) with (60) and (62) yields the M„ < G{i) < Kn case of (53). □ 



We can now complete the proof of Lemma 7. 1 



Proof of Lemma 7.1 Fix d > 2 and a > 0. Take /„ as defined by (50) so that g. 



le statement of Lemma 7.1, Again using the 



Hn{Kn + 1) and t„ = Hn{Mn + 1) are as in t 
shorthand Yn = C^'^iVn] In), we obtain from (52) with (53) that for all n > 



l<i<£(n) fc=l i:S7CL(fc) A;=M„+1 i:S7CL(fc) 

k=l k=Mn+l 



26 



where the additional factor in the last two terms takes care of the extra logarithmic 



factor when a = {d — l)/2. Using (48) and (49) we thus have that for any e > there exists 
C G (0, oo) such that for all n > 

For d > 2, this tends to zero as n ^ oo for a > {d—l)/2 and e sufficiently small, which gives 



(46). On the other hand, for a < d/2, we have from (63), noting that {2a/ d) — {a/{d— 1)) 
{a/d){d-2)/{d-l), that 



]g^^(2a/d)-ly^2j ^ ^^(Q/<i)(d-2)/(<i-l)-(l/2)+(4+4a)£ j^-^ _^ ^(l+2«)£^| _^ ^^-{l/d)+(7H 



-2a)e 



which also tends to zero as n oo for e small enough and d > 2. This gives (47). □ 



Proof of Theorem \2.2\ Again we use the construction of Lemma [7. 1[ For the duration of 
this proof, to ease notation, set X„ = £'^'"(P„; r„), F„ = £'^'"(P„; /„) and Z„ = £^'"(P„; 

Thus C'^'^'iVn) = x„ + y; + z„. 



First suppose a G (0, d/2). Then from (28) and (47) we have that 



as n — > oo. With Lemma [4^ and Slutsky's theorem, we obtain the a G {0,d/2) case of 
Theorem | 



Now suppose a > d/2. Then Lemma 4.4 and (46) imply that X„ + — * as n ^ oo. 



So (|27|) with Slutsky's theorem gives the_a; > d/2 case of Theorem 2.2 

ies that - 
that X„ ■ 



Finally suppose a = d/2. Again (46) imp 
that Zn Q{d — l,d/2) and from Lemma 



4.4 



0. Here we have from (27) 



Wi where Wi is a normal 



random variable. We need to show that the limits Wi and Q{d — I, d/2) are independent. 
Set kn := [n(i/^)-(^/2)^_ p^^ ^ G Z'^"^ n [0, K^-^ let C(z) C /„ denote the cube 

C(z) := {k~\0) + {0,k-r-' X {g^ - k-\g^]. 

Thus there are fcjj"-'^ = 6(n^"'^^/'^)~^'^'^~^)/^) such cubes, each of volume /c^'^ = 0(n~^+*^'^'^/^)). 
Let An denote the event 

An := Pi {card(P„ n C(z)) > : z G Z'^'^ f] [0, knf'^} . 

The number of points of Vn in each cube C(z) is a Poisson random variable with mean 
e{n'^'/^), and so 

P(^^) < ^P(card(K n C(z)) = 0) = 0{n'^-'^^/'^^-'^'^''^'>/^ ■ expi-Cn"^'/^)) 0, 



as n ^ oo. Given a configuration of Vn satisfying An, for n sufficiently large, X„ and Z„ are 
(conditionally) independent, since no point of Vn H r„ can be joined to a point of Vn H Bn 
in the MDST. Then the proof is completed by following the argument for Equation (7.25) 
in [23]. □ 
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8 Proof of Theorem 12.1 



In order to complete the proof of Theorem 2.1 , we need to add to the law of large numbers 
away from the boundary (in region r„), Lemma 4.1, by dealing with the edges near to the 
boundary. We proceed in a similar fashion to Sections |5] and [7| dealing with the contributions 
from the region Bn in Lemma 8.2 below (using the coupling to the ONG as in Section Isl), 



and with the contributions from the region in Lemma 8.1 below (using the construction 
of Section [T]). 

Lemma 8.1 Suppose d G {2,3, . . .} and a > 0. Then for small enough e > there exist 
Qn = Q{n^~^^^'^^) and tn = Q{n~^^^'^^~'^) specifying J„ for which, as n oo, 

and C'^'^'iVn; /„) 0, m L\ {a > d - 1). (65) 

Proof. Recall the construction of the partition of /„ described in Section[7| and the definition 
of the event En from (57). Then 



E[£'^'"(P„; Q] = E[/:'^'"(P„; QIeJ + E[/:'^''^(P„; QUg], 
where by Cauchy-Schwarz 



(66) 



where N(n) = card('P„) is Poisson distributed with mean n. Thus by (58) there exists 
C G (0, oo) such that 



E[/:'^'"(K;4)lsc] = 0(exp(-Cn^)). 
Also, using the construction of Section [7], 



(67) 



k=M„+l i:S^CL{k) 



2e 



Given E^, as in the proof of Lemma 7.3, the number of points in each is bounded by n 
the degree of each point is bounded by n^, and each edge has length bounded by a constant 
times n^2~^^^\ Thus 



K„ 



k = Mn + l 



Thus from m&i with and rt68l) we obtain 



(68) 



E[/:'^'"(P„; /„)] = o(2('^-^-")^"n(=^+°)") + 0(2('^-^-")*^"n(^+")^), 
where the additional ra^ factor in the second term takes care of the extra logarithmic factor 



when a = d — 1. Using (48) and (49) we have for d > 2 

E[£'^'°(P„;/„)] = 0(nl-("/'^)-(l/'^)+(=^+")^) + 0(n(^/2)-K(2(d-l)))+(2+6a).) 



(69) 
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For a > d — 1 this tends to zero as n — oo for e small enough, and so we obtain (65). On 
the other hand, for a G (0, d), we have from (69) that 



which again tends to zero for e small enough, giving (64). □ 



Recall the definition of the point process V„ C (0, 1) from (30). 
Lemma 8.2 Suppose d E {2, 3, . . .}. For a G (0, d) we have that as n ^ oo 

^{a/d)-l^d-l,a^y^^ 0, m L\ (70) 
Also, for a > d, there exist finite positive constants fi{d — l,a) such that as n ^ oo, 

E[0'^-i'°(V„)]^Mc? -!,«)• (71) 
In particular, /i(l,a) = ^(^(1 + f^)- 

Proof. Suppose a G (0, d). Recall that /5„ = card(V„) is Poisson with mean QirS^^"^^'^). Let 
Ui, U2, ... be a sequence of independent uniform random vectors on (0, 1)"^. Let lAm denote 
the sequence of uniform random vectors in (0, l)'^^^ formed by the sequence orthogonal 
projections down onto (0, l)'^"^ of the points of {Ui, . . . , Um}n-B„ listed in order of increasing 
Xfi-coordinate. Then, without loss of generality, we can assume that Vn = {Ui, . . . , UAr(„)} 
with N{n) Poisson with mean n, /5„ = card('P„ fl Bn), and Vn = l^i3„ in this notation. 

Let An denote the event {/?„ > ntn + n^/^}. Then by standard Chernoff bounds on 
Poisson tails (see, e.g.. Lemma 1.2 of [17]), P(A„) = ©(e"^"") for some C G (0, 00). With 
the coupling described above, 

^{a/d)-l(^d~l,a^y^^ < n^^/'^^-^C'^-^'" (Wp„4^+„l/4^ ) + 71^^/'^^'^ A„C' N {u) , (72) 

for some C G (0, oo) and N{n) = card('P„) is Poisson with mean n. By Theorem 2.1 of [24], 
for a < d — 1 we have that as m — cxo 

E[0'^"^'"(W„)] = 0(m('='-i-")/('^-^)), (73) 

and also 

E[0''-''''-\Um)] = Oihgm), E[0'^-i'-(Wj] ^^(rf-l,a) {a>d-l), (74) 

for some positive constant fi{d — l,a): this notation coincides with Proposition 2.1 of [33]. 
The particular values /x(l,a) = ^^^i^ + frj) for a > 1 were given in Proposition 2.1 
of [24]. Thus by (|73|), if a < - 1, 

E[n(-/'^)-iO'^-i'"(ZYp„,^+„V4i)] = O ^n-^m-e+aiid-2+ed)/i2did-m^^ ^ 
as 77, — »• oo, for e small. Also, for a G [rf — 1, rf) 

E[n("/'^)-lO'^-l'"(Wrnt„+nV^l)]^0, 
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as n — i> oo, by (74). Also by Cauchy-Schwarz 



(75) 



as n — > oo. So from (72) this completes the proof of (jTOj). 

For the proof of (71 ), let A'^ denote the event that {Pn < ntn — m}^^}. Then by Chernoff 
tail bounds again, P(A'„) = ©(e"*"""). We have that there is a constant C G (0, oo) such 
that for all n 



O 



d-l,a/ 



(l^lnt^-n^n) - U'„C'n < 0^-^'"(V„) < 0'^-^'"(Wr„,„+„v4i) + U^C'Nin). (76) 



d-l.c 



Suppose a > d > d — 1. Then by (74) and (75) we have that the expectations of both the 
lower and upper bounds in (76) converge to fi{d — l,a). Thus we have (71). □ 



Proof of Theorem 12.11 Consider 

£'^'"(P„,) = C^'^'iVn, r„) + /:"'"(P„; B„) + C^'^iVn, In). 

First suppose a G (0, d). We have 

= E[n("/'^^-iC'^-^'"(V„)] + n^"/'^^-^E [/:'^'"(P„; 5„) - ^'^"^'"(V, 



(77) 



(7J 



From ( 70 ) we have that the first term on the right-hand side of ( 78 ) tends to zero as n ^ oo 
for a G {0,d). By (32) we have that for a G (0, 1) the second term on the right-hand side 
of (Q is 0(ri"((i/°'"/2)-£)-(i/2)-£^ ^g^^g 2ero for d > 2, and ^ yields the same 

result for a > 1. Thus for any a G (0^, we have that n('^/'=')-i£^'°(P„; 5„) tends to zero m 
L^. Then multiplying both sides of (77) by n*-°'/'^^~^ and applying Lemma 4.1 and (64) we 
obtain 

Now suppose a > d. We have 



ElC^'^iVn, fin)] = E[0'^-1'"(V„,)] + E [C^'^'iVn, ^n) - O^-^'^iVn)] • 



(79) 



By (31) the last term on the right of (79) tends to zero as n — oo, since a > d > 1. Also, 
(71 ) says that the first term on the right of (79) tends to fi{d — 1, a). Thus for a > d 

E[£'^'"(P„;fi„)] ^/x(c/-l,a). 



Also, Lemma 4.1 implies that, for a > d 

E[/:^'"(p„;r„)] ^0, 

and for a = d 



E[/:^'^(p„;r„)] 



2v: 



Then taking expectations in ( [TT] ) and using (65) gives ([3]). This completes the proof of 
Theorem Ell □ 
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